High-level Control of Autonomous Robots Using a Behavior-based Scheme and Reinforcement Learning

نویسندگان

  • M. Carreras
  • J. Yuh
  • J. Batlle
چکیده

This paper proposes a behavior-based scheme for high-level control of autonomous robots. Two main characteristics can be highlighted in the control scheme. Behavior coordination is done through a hybrid methodology, which takes in advantages of the robustness and modularity in competitive approaches, as well as optimized trajectories in cooperative ones. As a second feature, behavior state/action mapping is learnt by means of Reinforcement Learning (RL). A new continuous approach of the Q_learning algorithm, implemented with a multi-layer neural network, is used. The behavior-based scheme attempts to fulfill simple missions in which several behaviors/tasks compete for the vehicle’s control. This paper is centered in the RL-based behaviors. In order to test the feasibility of the proposed Neural-Q_learning scheme, real experiments with the underwater robot ODIN in a target following behavior were done. Results showed the convergence of the behavior into an optimal state/action mapping. Discussion about the proposed approach is given, as well as an overall description of the high level control scheme. Copyright  2002 IFAC

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Clay: Integrating Motor Schemas and Reinforcement Learning 1 Background and Related Work 1.1 Motor Schemas

Clay is an evolutionary architecture for autonomous robots that integrates motor schema-based control and reinforcement learning. Robots utilizing Clay beneet from the real-time performance of motor schemas in continuous and dynamic environments while taking advantage of adaptive reinforcement learning. Clay coordinates assemblages (groups of motor schemas) using embedded reinforcement learning...

متن کامل

Using BELBIC based optimal controller for omni-directional threewheel robots model identified by LOLIMOT

In this paper, an intelligent controller is applied to control omni-directional robots motion. First, the dynamics of the three wheel robots, as a nonlinear plant with considerable uncertainties, is identified using an efficient algorithm of training, named LoLiMoT. Then, an intelligent controller based on brain emotional learning algorithm is applied to the identified model. This emotional l...

متن کامل

Rapid Reinforcement Learning for Reactive Control Policy Design in Autonomous Robots

This paper describes work in progress on a neural-based reinforcement learning architecture for the design of reactive control policies for an autonomous robot. Reinforcement learning techniques allow a programmer to specify the control program at the level of the desired behavior of the robot, rather than at the level of the program that generates the behavior. In this paper, we explicitly beg...

متن کامل

A Distributed Adaptive Control Architecture for Autonomous Agents

Recently considerable interest in behavior-based robots has been generated by industrial, space and defence related activities. Such independent robots are envisioned to perform tasks where safety or economic factors prevent direct human control and communication difficulties prevent easy remote control. Although many successes have been reported using behavior-based robots with prespecified sk...

متن کامل

Guest editorial: Special issue on robot learning, Part A

Creating autonomous robots that can assist humans in unpredictable situations of daily life has been a long standing vision of robotics, artificial intelligence, and the cognitive sciences. With the current rise of physical humanoid and other highly mechanically capable robots in robotics research labs around the globe, we have come a step closer to this aim. Thus, it has become essential to cr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002